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5 1998. 
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FIELD OF THE INVENTION 

The subject invention is directed generally to a 
15 genetic assay, and more particularly to a genetic assay 
for protein nuclear transport, including nuclear import 
and nuclear export . 



BACKGROUND OF THE INVENTION 

2 0 Throughout this application various publications are 

referenced, many in parenthesis. Full citations for each 
of these publications are provided at the end of the 
Detailed Description. The disclosures of each of these 
publications in their entireties are hereby incorporated 
25 by reference in this application. 

Nucleo-cytoplasmic shuttling of protein molecules is 
a basic biological process central to the regulation of 
gene expression (which underlies all aspects of 
development, morphogenesis, and signaling pathways in 

3 0 eukaryotic organisms) . Furthermore, transport of proteins 

and protein-nucleic acid complexes in and out of the 
nucleus is an essential step in many host-pathogen 
interactions such as viral and bacterial infection. 
Nuclear traffic occurs exclusively through the nuclear 
35 pore complex (NPC) . While small molecules (up to 40-60 
kDa) diffuse through the NPC, nuclear import of larger 



molecules is mediated by specific Nuclear Localization 
Signal (NLS) sequences contained in the transported 
molecule (Garcia-Bustos et al . 1991; Dingwall 1991) . 
Most NLSs can be classified in three general groups: (i) 
5 a monopartite NLS exemplified by the SV40 large T antigen 
NLS (SEQ ID N0:3: PKKKRKV) ; (ii) a bipartite motif 
consisting of two basic domains separated by a variable 
number of spacer amino acids and exemplified by the 
Xenopus nucleoplasmin NLS (SEQ ID NO : 4 : 

10 KRXXXXXXXXXXKKKL) ; and (iii) noncanonical sequences such 
as M9 of the hnRNP Al protein, the influenza virus 
nucleoprotein NLS, and the yeast Gal4 protein NLS 
(Dingwall and Laskey 1991) . 

Once in the nucleus, many proteins are transported 

15 back to the cytoplasm as an essential step in their 
biological function. For example, the Rev protein of 
human immunodeficiency virus type 1 (HIV-1) exits the 
nucleus, facilitating export of the unspliced viral RNA 
(Pollard and Malim 1998) . Rev nuclear export is mediated 

2 0 by a specific Nuclear Export Signal (NES) consisting of 
the leucine-rich sequence, SEQ ID NO : 5 : LPPLERLTL, found 
also in proteins of other viruses (Dobbelstein et al . 
1997) . Also, numerous cellular proteins, such as I-kB 
and MAPKK, contain potential NES sequences which may 

25 regulate the biological activity of these proteins by 
controlling their nuclear export (Ullman et al . 1997). 

The relatively small size of the NLS and NES 
sequences and, more importantly, the lack of clear and 
consistent consensus motifs in these signals, make it 

30 difficult to predict their presence in a given protein 
based solely on the analysis of its amino acid sequence. 
Furthermore, even if a consensus NLS or NES were found, 
it may not represent a functional signal. For example, 
P -glucuronidase (GUS) , a commonly-used reporter enzyme 



which resides exclusively in the cell cytoplasm (Varagona 
et al . 1991; Citovsky et al . 1992), carries a perfect, 
albeit non- functional , bipartite NLS at its carboxy 
terminus. Thus, the only practical way to identify 
5 active NLS or NES signals is by microinj ecting (Guralnick 
et al. 1996; Goldfarb et al . 1986; Kalderon et al . 1984) 
or expressing the protein of interest in eukaryotic cells 
(Varagona et al . 1991; Citovsky et al . 1992; Robbins et 
al. 1991; Roberts et al . 1987), heterokaryon formation 

10 (Michael et al . 1995), or using an in vitro transport 
system (Ossareh-Nazari et al . 1997; Schlenstedt et al . 
1993; Newmeyer et al . 1988; Ballas and Citovsky 1997). 
Two major experimental approaches have been developed in 
this regard. In one approach, the protein of interest is 

15 labeled, microinj ected into eukaryotic cells, and its 
intracellular localization determined. In another 
approach, the tested genes are fused to a reporter 
( |3-galactosidase , green fluorescent protein, etc.), 
expressed in eukaryotic cells, and the localization of 

2 0 the resulting fusion protein determined. Both methods 

have serious technical disadvantages. The first approach 
is very labor-intensive and requires highly trained 
personnel experienced in protein purification, 
microinjection, and fluorescent or electron microscopy 
25 techniques. The second method is also very laborious, 
involving often elaborate procedures for genetic 
transformation of higher eukaryotic cells and microscopy 
observations. Since both of these procedures rely on 
physical intracellular localization of the protein, 

3 0 common artifacts such as perinuclear binding can present 

problems in analysis of results. 

A need continues to exist, therefore, for a method 
for determining whether newly- cloned genes may encode a 
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protein that localizes to or is exported from the cell 
nucleus . 

SUMMARY OF THE INVENTION 

5 The subject invention addresses this need by methods 

and compositions for determining the presence of a 
nuclear localization signal or a nuclear export signal in 
a protein of interest. 

In regard to the nuclear localization signal, the 

10 invention provides a method of determining the presence 
of a nuclear localization signal in a protein of 
interest. The method comprises: selecting a host cell 
for use in the method, wherein the host cell contains a 
nucleus having nucleic acid encoding a reporter gene 

15 therein and wherein the host cell has a first level of 
expression of the reporter gene; identifying a DNA 
binding domain and an activation domain for the reporter 
gene; constructing a chimeric nucleic acid encoding a 
fusion protein comprising the DNA binding domain, the 

20 activation domain, and a protein of interest, wherein 

elements of the fusion protein other than the protein of 
interest have no nuclear localization signals; 
introducing the chimeric nucleic acid into the host cell; 
and determining a second level of expression of the 

25 reporter gene to determine the presence of a nuclear 
localization signal in the protein of interest. 

The invention further provides a recombinant host 
cell comprising: a nucleus having nucleic acid encoding a 
reporter gene therein; and a chimeric nucleic acid 

30 encoding a fusion protein, the fusion protein comprising 
a DNA binding domain for the reporter gene, an activation 
domain for the reporter gene, and a protein of interest, 
wherein elements of the fusion protein other than the 
protein of interest have no nuclear localization signals. 



Further provided is a chimeric nucleic acid encoding 
a fusion protein, the fusion protein comprising a DNA 
binding domain for a reporter gene, an activation domain 
for the reporter gene, and a protein of interest, wherein 
5 elements of the fusion protein other than the protein of 
interest have no nuclear localization signals. A vector 
comprising the chimeric nucleic acid molecule, as well as 
a kit comprising the vector, are also provided. 

Additionally provided is a nucleic acid molecule 

10 encoding a modified LexA protein, wherein the modified 
LexA protein has no nuclear localization signal, as well 
as a modified LexA protein, wherein the modified LexA 
protein has no nuclear localization signal. 

In regard to the nuclear export signal, the 

15 invention provides a method of determining the presence 
of a nuclear export signal in a protein of interest. The 
method comprises: selecting host cells for use in the 
method, wherein each of the host cells contain a nucleus 
having nucleic acid encoding a reporter gene therein; 

2 0 identifying a DNA binding domain and an activation domain 
for the reporter gene; constructing a chimeric nucleic 
acid encoding a fusion protein comprising the DNA binding 
domain, the activation domain, and a nuclear localization 
signal, wherein elements of the fusion protein have no 

25 nuclear export signals; introducing the chimeric nucleic 
acid into one of the host cells; determining a first 
level of expression of the reporter gene; constructing a 
second chimeric nucleic acid encoding a second fusion 
protein comprising the DNA binding domain, the activation 

30 domain, the nuclear localization signal, and a protein of 
interest; introducing the second chimeric nucleic acid 
into another one of the host cells; and determining a 
second level of expression of the reporter gene to 



determine the presence of a nuclear export signal in the 
protein of interest. 

The invention further provides a recombinant host 
cell comprising: a nucleus having nucleic acid encoding a 
5 reporter gene therein; and a chimeric nucleic acid 

encoding a fusion protein, the fusion protein comprising 
a DNA binding domain for the reporter gene, an activation 
domain for the reporter gene, and a nuclear localization 
signal, wherein elements of the fusion protein have no 

10 nuclear export signals. 

Further provided is a chimeric nucleic acid encoding 
a fusion protein, the fusion protein comprising a DNA 
binding domain for a reporter gene, an activation domain 
for the reporter gene, and a nuclear localization signal, 

15 wherein elements of the fusion protein have no nuclear 

export signals. A vector comprising the chimeric nucleic 
acid molecule, as well as a kit comprising the vector, 
are also provided. 

More particularly, the invention provides a simple 

2 0 functional assay for protein nuclear import and export 
which circumvents all of the above mentioned 
difficulties. This assay has been used to demonstrate 
the nuclear import and export activities of a capsid 
protein (CP) from a plant geminivirus, suggesting a role 

2 5 for CP in nuclear shuttling of viral genomes during the 

infection process. The simple genetic system is used to 
detect active nuclear import (NLS) and export targeting 
signals (NES) based on their function within yeast cells. 
To generate one embodiment of this system, a gene 

3 0 encoding the bacterial LexA protein was modified (mLexA) 

to abolish its intrinsic nuclear targeting activity and 
fused to a sequence coding for the activation domain of 
the yeast Gal4 protein (Gal4AD) in the absence or 
presence of the SV40 large T-antigen NLS. In the nuclear 



import assay, if a protein of interest fused to the 
mLexA-Gal4AD hybrid contains a functional NLS, the fusion 
product will enter the yeast cell nucleus and activate 
the expression of reporter genes . In the nuclear export 
5 assay, if a protein of interest fused to the mLexA-SV40 
NLS-Gal4AD hybrid contains a functional NES , the fusion 
product localized to the cell nucleus will exit into the 
cytoplasm, decreasing the reporter gene expression 
levels. This system was tested using proteins with known 

10 NLS and NES sequences and then the system was utilized to 
identify an NES within the capsid protein of a plant 
geminivirus. The results indicate that this system is 
applicable as a general method to identify and 
quantitatively analyze functional NLS and NES as well as 

15 to specifically select for proteins containing these 
signals . 



BRIEF DESCRIPTION OF THE DRAWINGS 

These and other features and advantages of this 
invention will be evident from the following detailed 
description of preferred embodiments when read in 
conjunction with the accompanying drawings in which: 

Fig. 1 is a map of pGAD424; 

Fig. 2 is a map of pBTM116; 

Fig. 3 is a map of pEE2 ; 

Fig. 4 is a map of pED2 ; 

Fig. 5 is a map of pmLexA : : GAL4AD ( - ) NLS ; 

Fig. 6 is a map of pmLexA : : GAL4AD ( - ) NLS : : VirE2 ; 

Fig . 7 is a map of pmLexA : : GAL4AD ( - ) NLS : : VirD2 ; 

Fig. 8 is a map of pmLexA : : GAL4AD ( +) NLS : : VirE2 ; 

Fig. 9 is a map of pmLexA : : GAL4AD ( - ) NLS : : 2DriV; 

Fig. 10 is a map of pLG, showing the fusion protein 
construct used for the one-hybrid protein nuclear import 
assay; 
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Fig. 11 illustrates the results of the (3- 
galactosidase assay used to detect nuclear import of a 
tested protein; 

Fig. 12 illustrates the results of the selective 
5 reporter gene (HIS3) used to detect nuclear import of a 
tested protein; 

Fig. 13 is a map of pNEA; 

Fig. 14 is a map of pNEA: :Rev; 

Fig. 15 is a map of pNEA::VirE2; 
10 Fig. 16 is a schematic representation of pNIA and 

pNEA plasmids; 

Fig. 17 illustrates the wild type LexA NLS and the 
modified LexA NLS; 

Fig. 18 illustrates the results of a (3-galactosidase 
15 assay used to detect nuclear import of a tested protein; 

Fig. 19 illustrates the results of the selective 
reporter gene (HIS3) used to detect nuclear import of a 
tested protein (medium deficient for both tryptophan and 
histidine) ,- 

20 Fig. 20 illustrates the results of a [3-galactosidase 

assay used to detect nuclear export of a tested protein; 

Fig. 21 illustrates the results of the selective 
reporter gene (HIS3) used to detect nuclear export of a 
tested protein (medium deficient for both tryptophan and 
25 histidine) ; 

Fig. 22 illustrates the results of the selective 
reporter gene (HIS3) used to detect nuclear export of a 
tested protein (medium deficient for only tryptophan) ; 
Fig. 23 illustrates the results of the selective 
30 reporter gene (HIS3) used to detect nuclear export of the 
capsid protein (CP) of tomato yellow leaf curl virus 
(TYLCV) (medium deficient for both tryptophan and 
histidine) ; and 



Fig. 24 illustrates the results of the selective 
reporter gene (HIS3) used to detect nuclear export of the 
CP of TYLCV (medium deficient for only tryptophan) . 



5 DETAILED DESCRIPTION OF THE INVENTION 

Abbreviations ; PCR, polymerase chain reaction; 
mLexA, modified LexA; NIA, nuclear import assay; NEA, 
nuclear export assay; Gal4AD, Gal4 activation domain; 
HIV, human immunodeficiency virus; NLS, nuclear 
10 localization signal; NES, nuclear export signal; ORF, 

open reading frame; TYLCV, tomato yellow leaf curl virus; 
3 AT, 3 -amino- 1 , 2 , 4 -triazole . 

The subject invention provides a method of 
determining the presence of a nuclear localization signal 
15 in a protein of interest. The method comprises: 

selecting a host cell for use in the method, wherein the 
host cell contains a nucleus having nucleic acid encoding 
a reporter gene therein and wherein the host cell has a 
first level of expression of the reporter gene; 

2 0 identifying a DNA binding domain and an activation domain 

for the reporter gene; constructing a chimeric nucleic 
acid encoding a fusion protein comprising the DNA binding 
domain, the activation domain, and a protein of interest, 
wherein elements of the fusion protein other than the 
25 protein of interest have no nuclear localization signals; 
introducing the chimeric nucleic acid into the host cell; 
and determining a second level of expression of the 
reporter gene to determine the presence of a nuclear 
localization signal in the protein of interest. 

3 0 The invention further provides a recombinant host 

cell comprising: a nucleus having nucleic acid encoding a 
reporter gene therein; and a chimeric nucleic acid 
encoding a fusion protein, the fusion protein comprising 
a DNA binding domain for the reporter gene, an activation 



domain for the reporter gene, and a protein of interest, 
wherein elements of the fusion protein other than the 
protein of interest have no nuclear localization signals. 

Further provided is a chimeric nucleic acid encoding 
a fusion protein, the fusion protein comprising a DNA 
binding domain for a reporter gene, an activation domain 
for the reporter gene, and a protein of interest, wherein 
elements of the fusion protein other than the protein of 
interest have no nuclear localization signals. A vector 
comprising the chimeric nucleic acid is also provided. 

Additionally provided is a nucleic acid molecule 
encoding a "modified" LexA protein which does not have a 
nuclear localization signal. In a presently preferred 
embodiment, the nucleic acid molecule encodes an amino 
acid sequence as shown in SEQ ID NO : 2 . SEQ ID NO : 2 
represents the amino acid sequence of the naturally- 
occurring LexA protein but with substitutions R157G and 
K159E. These amino acid substitutions prevent the 
nuclear localization signal normally present in the LexA 
protein from functioning properly. Therefore, the 
"modified" LexA protein having amino acid SEQ ID NO : 2 has 
no nuclear localization signal (i.e. no functional 
nuclear localization signal) and cannot enter the nucleus 
on its own. In one preferred embodiment, the nucleic 
acid molecule encoding the "modified" LexA protein has a 
nucleotide sequence as shown in SEQ ID N0:1. SEQ ID NO : 1 
represents the nucleotide sequence of the naturally- 
occurring LexA protein but with the codons for amino acid 
residues 157 and 159 changed from CGC and AAA to GGC and 
GAA, respectively. These nucleotide substitutions alter 
the amino acid sequence of the LexA protein such that the 
nuclear localization signal normally present in the LexA 
protein does not function properly. 



The invention further provides a "modified" LexA 
protein (mutated or modified from its naturally occurring 
amino acid and/or nucleotide sequence) , wherein the 
modified LexA protein has no nuclear localization signal 
5 but maintains its ability to bind promoter elements. As 
discussed above, in a presently preferred embodiment the 
"modified" LexA protein has an amino acid sequence as 
shown in SEQ ID NO : 2 . 

Also provided is a method of determining the 

10 presence of a nuclear export signal in a protein of 

interest. The method comprises: selecting host cells 
for use in the method, wherein each of the host cells 
contain a nucleus having nucleic acid encoding a reporter 
gene therein; identifying a DNA binding domain and an 

15 activation domain for the reporter gene; constructing a 
chimeric nucleic acid encoding a fusion protein 
comprising the DNA binding domain, the activation domain, 
and a nuclear localization signal, wherein elements of 
the fusion protein have no nuclear export signals; 

2 0 introducing the chimeric nucleic acid into one of the 

host cells; determining a first level of expression of 
the reporter gene; constructing a second chimeric nucleic 
acid encoding a second fusion protein comprising the DNA 
binding domain, the activation domain, the nuclear 
25 localization signal, and a protein of interest; 

introducing the second chimeric nucleic acid into another 
one of the host cells; and determining a second level of 
expression of the reporter gene to determine the presence 
of a nuclear export signal in the protein of interest. 

3 0 The invention further provides a recombinant host 

cell comprising: a nucleus having nucleic acid encoding a 
reporter gene therein; and a chimeric nucleic acid 
encoding a fusion protein, the fusion protein comprising 
a DNA binding domain for the reporter gene, an activation 



- 12 - 



domain for the reporter gene, and a nuclear localization 
signal, wherein elements of the fusion protein have no 
nuclear export signals. 

Further provided is a chimeric nucleic acid encoding 
5 a fusion protein, the fusion protein comprising a DNA 
binding domain for a reporter gene, an activation domain 
for the reporter gene, and a nuclear localization signal, 
wherein elements of the fusion protein have no nuclear 
export signals. A vector comprising the chimeric nucleic 
10 acid molecule, as well as a kit comprising the vector, 
are also provided. 

As used herein, "naturally occurring" as applied to 
an object refers to the fact that the object can be found 
in nature. For example, a protein that is present in an 
15 organism that can be isolated from that organism and 

which has not been intentionally modified by man in the 
laboratory is "naturally occurring" . 

As further used herein, a "nuclear localization 
signal" refers to an intrinsic signal in a protein or 

2 0 molecule that mediates active transport of the protein or 

molecule across nuclear pore complexes into the nucleus. 
As further used herein, a "nuclear export signal" refers 
to an intrinsic signal in a protein or molecule that 
mediates active transport of the protein or molecule 
25 across nuclear pore complexes out of the nucleus. 

A "protein of interest" is intended to refer to any 
protein for which one wishes to determine whether such 
protein has a nuclear localization signal and/or a 
nuclear export signal. 

3 0 With this general understanding of the terms used 

herein, the invention in regard to nuclear import 
provides an expression vector comprising a chimeric 
nucleic acid molecule (the chimeric nucleic acid molecule 
is described above as encoding a fusion protein, the 



fusion protein comprising a DNA binding domain for a 
reporter gene, an activation domain for the reporter 
gene, and a protein of interest, wherein elements of the 
fusion protein other than the protein of interest have no 
5 nuclear localization signal) . In a presently preferred 
embodiment, the expression vector is a yeast one-hybrid 
expression vector, designated pLG, which was designed to 
conveniently and rapidly assay the ability of proteins to 
enter the cell nucleus. pLG expresses a triple-fusion 

10 protein comprising a modified bacterial LexA (the DNA 
binding domain) , yeast Gal4 activation domain, and the 
tested protein encoded by a cDNA subcloned in- frame into 
the multiple cloning site downstream of Gal4 activation 
domain open reading frame (ORF) (Fig. 10) . 

15 When this expression vector is introduced into a 

host cell (having a nucleus having nucleic acid encoding 
the reporter gene therein) , if the tested protein 
contains a functional nuclear localization signal (NLS) , 
the fusion protein will enter the host cell nucleus. A 

20 presently preferred host cell for use with the pLG vector 
is the yeast L4 0 host cell, which contains a LacZ gene 
and a HIS3 gene. Following nuclear import (if the 
protein of interest includes a nuclear localization 
signal) , the LexA domain targets the fusion protein to 

25 the LexA operator sites of the reporter lacZ gene 

contained in the L4 0 yeast strain. The Gal4 activation 
domain then activates the expression of lacZ resulting in 
|3-galactosidase activity. In the absence of NLS, the 
fusion construct is unable to reach the cell nucleus and, 

3 0 thus, is unable to activate the reporter gene. Indeed, 
expression of pLG carrying a cDNA for a non-nuclear 
protein does not produce any detectable [3 -galactosidase 
activity (Figs. 11 and 18). 
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In addition to induction of the (3-galactosidase 
reporter, this one-hybrid system allows one to directly 
select for the nuclear import of the tested protein in 
the same L4 0 yeast strain, which contains an integrated 
5 copy of the HIS3 gene with upstream LexA operators. 

Similarly to the p-galactosidase expression, only cells 
expressing the NLS- containing fusion protein are able to 
grow on a histidine-def icient medium (Figs. 12 and 19) . 

It should be apparent that because nuclear transport 

10 machinery is generally well conserved between different 
organisms (Nigg 1997) , the import and export signals 
identified in accordance with the subject invention will 
likely be active in other eukaryotic cell types (and 
therefore host cells other than yeast can be used in the 

15 methods and compositions of the subject invention) . 

A component of the pLG vector is a modified LexA 
gene (see above discussion) . Clearly, the success of the 
nuclear import assay using this vector hinges on the 
inability of LexA :: Gal4 :: tested protein fusions to enter 

20 the cell nucleus in the absence of an NLS. Thus, neither 
LexA nor Gal4 should contain NLS sequences. Indeed, the 
Gal4 activation domain is known to lack NLS whereas LexA, 
a bacterial protein, was generally thought not to have 
evolved such a signal. However, the subject invention 

25 relies on the discovery that wild-type LexA carries a 

previously unidentified NLS sequence, rendering the above 
described experimental design impossible. To circumvent 
this difficulty, the LexA NLS was identified and 
inactivated by specific substitutions of two amino acid 

30 residues. This modification of LexA, or another 

modification to inactivate the NLS, is critical if the 
vector is constructed with LexA as the DNA binding 
domain. However, other DNA binding domains could be 
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chosen that do not contain a nuclear localization signal 
to begin with. 

pLG and L4 0 can be provided as a kit for simple and 
rapid functional assay of nuclear import. In addition, 
5 the kit should contain a positive control for nuclear 
import. A presently preferred positive control is a pLG 
derivative containing the SV4 0 NLS sequence at the 
LexA:Gal4 junction. Fusion proteins produced from this 
construct always localize to the nucleus, resulting in 

10 lacZ expression and cell growth in the absence of 
histidine. This control construct, therefore, is 
designed to demonstrate the functionality of the assay as 
well as the active conformation of the fusion protein. 
It should be readily apparent to one of ordinary 

15 skill in the art that various elements of the expression 
vector and the selection of a particular host cell in 
which to conduct the assay for protein nuclear import can 
be varied. For example, in the plasmid pLG the fusion 
protein is under regulatory control of the ADH1 promoter 

20 (see Fig. 10) . The selection of a strong promoter to 

control expression of the fusion protein in the host cell 
is beneficial to distinguish expression of the fusion 
protein (and therefore nuclear import of the fusion 
protein) in the event that the reporter gene is otherwise 

25 activated within the host cell. A strong constitutive 
promoter such as ADH1 can be used, or a strong inducible 
promoter such as the GAL promoter may be used. In either 
case, the host cell can be expressing a first level of 
reporter gene product (for example, lacZ detectable by (3- 

30 galactosidase) before introduction of the expression 
vector comprising the protein to be tested for nuclear 
import. Before the expression vector is introduced into 
the host cell in the case of a constitutive promoter, or 
after the expression vector is introduced into the host 



cell in the case of an inducible promoter, the first 
level can be determined (for the inducible promoter, the 
first level is after introduction but before induction) . 
An increase in expression of the reporter gene product 
5 after introduction and/or induction indicates that the 
fusion protein entered the host cell nucleus. If the 
expression of the reporter gene cannot be quantitated to 
reveal whether the level of expression of the gene has 
increased, then the reporter gene must only be activated 

10 by the fusion protein construct of the subject invention 
and not by any other elements within the host cell . In 
this case, expression of the reporter gene in a 
qualitative sense indicates the presence of a nuclear 
localization signal in the tested protein. In the event 

15 that the DNA binding domain and the activation domain 
"down" regulate the reporter gene, decreased levels of 
expression would be screened for. 

This could be the case where the DNA binding domain 
and the activation domain indirectly affect expression of 

2 0 the reporter gene, such as through a relay gene that 

represses expression of the reporter gene. This concept 
is discussed more fully in U.S. Patent No. 5,525,490, the 
contents of which are incorporated herein by reference. 
As an example, the DNA binding domain and activation 
25 domain could turn on the Gal80 gene which then represses 
Gal4 and therefore HIS3 or lacZ. As another example, 
consider the lac operon which includes lacZ. CAP induces 
expression of the lac operon and therefore lacZ, while 
the lac repressor represses expression of the lac operon 

3 0 and therefore lacZ. Positive and negative regulation of 

the reporter gene of the subject invention, and direct 
and indirect (such as through Gal8 0) regulation of the 
reporter gene are specifically intended to be covered 
herein by the language regarding a DNA binding domain and 
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an activation domain for a reporter gene. One thus 
compares the first and second levels of expression of the 
reporter gene to determine whether a nuclear import 
signal is present. If the nuclear binding domain and 
5 activation domain lead, directly or indirectly, to up- 
regulation of the reporter gene, then an increase in 
reporter gene expression signifies a nuclear localization 
signal. If the nuclear binding domain and activation 
domain lead, directly or indirectly, to down- regulation 

10 of the reporter gene, then a decrease in reporter gene 
expression signifies a nuclear localization signal. 

Examples of suitable alternative constitutive 
promoters include yeast promoters (such as PGK, GAP, and 
TPI) and mammalian promoters (such as CMV) , and examples 

15 of suitable alternative inducible promoters include yeast 
promoters (GAL1, GAL10, methionine) and mammalian 
promoters (glucocorticoid inducible promoter and 
estradiol inducible promoter) . 

The DNA binding domain and the activation domain for 

20 the chosen reporter gene can also be varied, as can the 
chosen reporter gene . The key is that the DNA binding 
domain recognizes and the gene activation domain 
activates, directly or indirectly, the same reporter 
gene) . Importantly, the selection of a DNA binding 

25 domain and activation domain must ensure that they do not 
contain functional NLS sequences. As with LexA, if a DNA 
binding domain or activation domain is chosen which 
includes an NLS, the NLS can be modified to eliminate the 
nuclear localization signal function. 

3 0 The combination of these two structural domains (the 

DNA binding domain and the activation domain) is 
generally referred to as a transcription activator. 
Transcription activators are proteins that generally 
positively regulate the expression of specific genes. As 
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indicated, they can be functionally dissected into two 
structural domains: one region that binds to specific 
DNA sequences and thereby confers specificity, and 
another region termed the activation domain that binds to 
5 protein components of the basal gene expression machinery 
(Ma and Ptashne (198 8) ) . These two domains need to be 
physically connected in order to function as a 
transcriptional activator. The host cell is chosen such 
that the transcriptional activator drives the expression 

10 of a specific reporter gene (such as HIS3 or lacZ) , which 
provides the read-out for the nuclear import. 

Examples of transcription activators include GAL4 
and VP16, and examples of reporter genes include lacZ, 
CAT, luciferase, and GFP. The knowledge of two 

15 structural domains of transcription activators, and the 
knowledge that each domain must be present to activate a 
gene, has been utilized in yeast two-hybrid 
methodologies. For discussions of yeast two hybrid 
procedures generally, see Fields and Song (1989) ; Chein 

20 et al . (1991); Silver and Hunt (1993); Durfee et al . 

(1993); Yang et al . (1992); Luban et al . (1993); Hardy et 
al. (1992); Bartel et al . (1993); Vojtek et al . (1993); 
Li and Fields (1993); Lalo et al . (1993); Jackson et al . 
(1993); Madura et al . (1993); Bardwell et al . (1993); 

25 Chakraborty et al . (1992); Straudinger et al . (1993); 

Milne and Weaver (1993); Iwabuchi et al . (1993); Bogerd 
et al . (1993); Dasmahapatra et al . (1992); Germino et al . 
(1993) ; and Guarente (1993) . 

In the Examples which follow, the yeast host cell 

3 0 has two built-in reporters. The first reporter is the 
(3-galactosidase enzyme. It is induced only after the 
tested protein-containing fusion product enters the cell 
nucleus, resulting in strong blue color of the yeast 
colonies. In addition, nuclear import of the fusion 
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protein induces an auxotrophic marker HIS3 , resulting in 
the ability of the yeast cells to grow on a histidine- 
deficient medium. Since histidine selection is known to 
be slightly leaky, the best results are achieved by 
5 including 3 -amino- 1 , 2 , 4 , -triazole in the growth medium. 
The Examples which follow also utilize the yeast strain 
L40 as the host cell. This strain is the most suitable 
for the exemplified assay because it contains both 
reporter genes under inducible promoters activated by the 

10 LexA- GAL4 AD fusion. This strain also is unable to grow 
in the absence of tryptophan and/or histidine, allowing 
for selective growth of cells containing the assay 
plasmid (TRP 1 marker) and/or induced reporter gene (HIS 
3 marker). If the assay cassette, i.e. the fusion gene 

15 mLexA : : GAL4AD { - ) NLS , is transferred to another vector 

with a different auxotrophic growth marker (e.g., LEU or 
URA) , the host strain has to be modified accordingly to 
allow the selective growth of the new plasmid. 

The reporter gene(s) can be substituted with other 

2 0 reporters. For example, (3-galactosidase can be exchanged 

with green fluorescent protein (GFP) and HIS3 replaced 
with URA3 . Although GFP detection does not require 
specific staining used for the (3-galactosidase assay, the 
latter may be more easily and accurately quantified. 
25 This quantification may be useful for comparisons of NLS 
activities between different proteins of interest or 
where the host cell has a first level of expression of 
the reporter gene . 

The primary requirements in accordance with the 

3 0 subject invention are that a host cell be chosen which 

contains a reporter gene therein (located in the nucleus 
of the host cell) , and that a DNA binding domain and an 
activation domain that interact with and activate that 
reporter gene, directly or indirectly, be chosen. Again, 
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the DNA binding domain and the activation domain should 
be chosen such that neither of them contain a nuclear 
localization signal, which would lead to nuclear import 
of the fusion protein even if the protein of interest did 
5 not contain a nuclear localization signal (a false 

positive result) . It should be readily apparent that a 
particular host cell could be recombinantly constructed 
to contain a desired reporter gene for use in the method 
of the invention. 

10 The appropriate screening method for expression of 

the reporter gene depends upon the reporter gene chosen. 
For example, an assay for (3-galactosidase is used to 
detect expression of the lacZ gene. Growth on a 
particular medium {i.e. a histidine deficient medium) can 

15 be used to detect expression of the HIS3 gene (referred 
to herein as a "selection marker" reporter gene) . Such 
reporter genes and their appropriate screening methods 
are known in the art . 

The methods and compositions of the subject 

20 invention require the construction of chimeric nucleic 
acid molecules and the introduction of such nucleic acid 
molecules into host cells. Routine techniques known in 
the art can be used to accomplish both of these tasks. 
In regard to the construction of chimeric nucleic acid 

25 molecules, the methods of Sambrook et al . (1989) are 
readily applicable. 

In regard to introduction of such nucleic acid 
molecules into host cells, methods known in the art for 
introducing nucleic acid molecules into cells include 

3 0 lithium acetate transformation, and include 

microinjection (in which DNA is injected directly into 
the cytoplasm of cells through fine glass needles) . 
Alternatively, DNA can be incubated with an inert 
carbohydrate polymer (dextran) to which a positively 
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charged chemical group (DEAE, for diethylaminoethyl ) has 
been coupled. The DNA sticks to the DEAE-dextran via its 
negatively charged phosphate groups. These large DNA- 
containing particles stick in turn to the surfaces of 
5 cells, which are thought to take them in by a process 
known as endocytosis. In another method, cells 
efficiently take in DNA in the form of a precipitate with 
calcium phosphate. In electroporation, cells are placed 
in a solution containing DNA and subjected to a brief 

10 electrical pulse that causes holes to open transiently in 
their membranes. DNA enters through the holes directly 
into the cytoplasm, bypassing the endocytotic vesicles 
through which they pass in the DEAE-dextran and calcium 
phosphate procedures. DNA can also be incorporated into 

15 artificial lipid vesicles, liposomes, which fuse with the 
cell membrane, delivering their contents directly into 
the cytoplasm. In an even more direct approach, DNA is 
absorbed to the surface of tungsten microproj ectiles and 
fired into cells with a device resembling a shotgun. 

20 Viral vectors could also be used to introduce 

nucleic acid into host cells. Baculovirus is regularly 
used to introduce nucleic acid into insect cells. 
Viruses of mammalians cells, such as retrovirus, vaccinia 
virus, adenovirus, and adeno-associated virus (AAV) , to 

2 5 name a few, can be used to introduce nucleic acid into 

mammalian host cells. 

In addition to the method and other aspects of the 
invention described above, the subject invention provides 
a nucleic acid molecule encoding a modified LexA protein, 

3 0 wherein the modified LexA protein has no nuclear 

localization signal. The invention further provides a 
modified LexA protein, wherein the modified LexA protein 
has no nuclear localization signal. As indicated above, 
the nucleic acid molecule and the protein represent a 
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modification of LexA which abolishes its intrinsic NLS 
activity but preserves its ability to bind promoter 
elements. Two specific amino acid changes in the LexA 
primary sequence were made, but, in principle, it is 
5 possible to alter other amino acids in LexA to achieve 
the same objectives, i.e. block the NLS function but 
retain the specific DNA binding to the LexA operators of 
the promoter. The modification of LexA uncouples nuclear 
import and promoter binding activities of LexA. 

10 Much of the above discussion is equally applicable 

to the invention in regard to nuclear export. In this 
regard, the invention provides an expression vector 
comprising a chimeric nucleic acid molecule (which 
encodes a fusion protein, the fusion protein comprising a 

15 DNA binding domain for a reporter gene, an activation 

domain for the reporter gene, and a nuclear localization 
signal, wherein elements of the fusion protein have no 
nuclear export signal) . In a presently preferred 
embodiment, the expression vector is a yeast one-hybrid 

2 0 expression vector, designated pNEA, which was designed to 
conveniently and rapidly assay the ability of proteins to 
exit the cell nucleus. pNEA expresses a fusion protein 
comprising a modified bacterial LexA (the DNA binding 
domain) , yeast Gal4 activation domain, the SV40 large T- 

2 5 antigen NLS, and the tested protein encoded by a cDNA 

subcloned in- frame into the multiple cloning site 
downstream of Gal4 activation domain open reading frame 
(Fig. 16) . When this expression vector is introduced 
into a host cell (having a nucleus having nucleic acid 

3 0 encoding the reporter gene therein) , the fusion protein 

should enter the host cell nucleus due to the SV4 0 large 
T-antigen NLS. If the tested protein contains a 
functional nuclear export signal strong enough to 
override the NLS, the fusion protein will not enter the 
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host cell nucleus and neither the LacZ gene nor the HIS3 
gene will be activated. 

As discussed above in regard to nuclear import, 
various host cells and elements of the expression vector 
5 can be selected for use in the assay for protein nuclear 
export. This includes variations in the promoter, DNA 
binding domain, activation domain, and reporter gene, as 
well as the NLS (replacing the SV40 large T-antigen NLS) . 
Likewise, the concept of direct and indirect affects 

10 on the expression of the reporter gene, and up- and down- 
regulation of the reporter gene, are equally applicable 
to the nuclear export aspect of the subject invention. 
If the tested protein includes a nuclear export signal, 
repression through a relay gene would then result in 

15 decreased export and therefore increased reporter gene 
expression (see above discussion in regard to nuclear 
import) . 

MATERIALS AND METHODS 

2 0 Yeast and Growth Conditions. Yeast cultures were 

grown and maintained in yeast extract/peptone/ dextrose or 
the appropriate selective minimal medium using standard 
conditions (Kaiser et al . 1994). Saccharomyces 
cerevisiae strain L40 (MATa his3A200 trpl-901 leu2-3,112 
25 ade2 lys2-801am URA3 : : (lexAop) 8 -lacZ LYS2 : : ( lexAop) 4 -HIS3) 
was used in all experiments (Hollenberg et al . 1995) . 
For selective growth in the absence of histidine, the 
medium was supplemented, if necessary, with 
3-amino-l, 2 , 4-triazole (3 AT) as specified for each 

3 0 specific experiment. Plasmids were introduced into yeast 

cells using the standard lithium acetate protocol (Kaiser 
et al . 1994) . 

DNA Constructions (see also Examples I and III) . 
For pNIA and its fusion constructs, the Gal4 activation 
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domain (Gal4AD) , derived from the plasmid pGAD424, was 
PCR-amplif ied without the adjacent SV4 0 large T-antigen 
NLS. During amplification, EcoRI and BamHI restriction 
sites were introduced at the 5' and 3' ends of the 
5 amplified fragment, respectively. Then, a wild- type lexA 
gene in the vector pBTMUS (Hollenberg et al . 1995) was 
ligated in- frame with Gal 4 AD following restriction 
digestion of the corresponding purified PCR fragments 
with EcoRI and BamHI using standard molecular biology 

10 protocols (Ausubel et al . 1987). The resulting fusion 
construct was designated pLG. Next, the BamHI fragment 
of the VirE2 ORF from pET3b-VirE2 (Citovsky et al . 1988) 
was subcloned in-frame into the BamHI site of pLG, 
placing it immediately downstream of Gal4AD to produce 

15 pLGE2 . Two amino acid residues of LexA within pLGE2 were 
mutated to produce the substitutions R157G and K15 9E by 
changing their codons CGC to GGC and AAA to GAA, 
respectively, using oligonucleotide directed mutagenesis 
with the Transformer™ Site-Directed Mutagenesis Kit 

2 0 (Clontech Laboratories, Inc.) according to the 

manufacturer's protocol. This procedure converted pLGE2 
to pNIAE2 . To produce pNIAD2 , the VirE2 ORF in pNIAE2 
was replaced with the BamHI fragment of VirD2 ORF from 
pGBTD2 (Ballas and Citovsky 1997) . 

2 5 For construction of pNEA, the same approach was 

employed except that the 5' primer for PCR amplification 
of Gal4AD included the sequence for the SV4 0 large 
T-antigen NLS, placing it at the amino terminus of 
Gal 4 AD . For pNEARev, the Rev ORF derived from pDM121 

30 (Dr. D. McDonald, Salk Institute) was PCR-amplif ied, 

introducing Bglll restriction sites at both ends of the 
fragment and ligated into the BamHI site of pNEA. The 
M10 mutant of Rev was PCR-amplif ied from pMlO (Malim et 
al . 1989) introducing Smal and PstI restriction sites at 
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5' and 3' ends of the amplified fragment, respectively, 
and subcloned into the Smal and PstI sites of pNEA to 
produce pNEAMlO . Similarly, RevA3 was PCR-amplif ied from 
pDM121A3NI (Dr. D. McDonald, Salk Institute) using the 5' 
5 and 3' primers containing Smal and PstI recognition 

sequences, respectively, and subcloned into the Smal and 
PstI sites of pNEA, resulting in the pNEARevA3 construct. 

To obtain pNEACP, the BamHI and PstI sites were 
introduced at the 5' and 3' ends, respectively, of the 

10 tomato yellow leaf curl virus (TYLCV) CP ORF amplified by 
PCR from pTYH2 0 which contains the full length viral 
genome (Navot et al . 1991). The resulting fragment was 
ligated into the BamHI and PstI sites of pNEA. To 
generate pNEACPAM, pNEACP was digested with Styl and 

15 Clal, purified, treated with the Klenow fragment of the 
E . coli DNA polymerase, and self -ligated, preserving the 
correct reading frame. Finally, for pNEACPAC, pNEACP was 
digested with PstI and Clal, purified, sequentially 
treated with T4 DNA polymerase and the Klenow fragment of 

20 the E. coli DNA polymerase, and self -ligated. 

Pfu polymerase (Stratagene) was used for all PCR 
reactions according to manufacturer's instructions. All 
mutations and ligation junctions were confirmed by DNA 
sequencing . 

25 Analytical Methods. For quantitative determination 

of the (3-galactosidase activity, the enzymatic assay was 
performed in liquid as described (Stachel et al . 1985). 
Qualitatively, p 5 -galactosidase was assayed on 
nitrocellulose filters as described (Hollenberg et al . 

30 1995) . 

For quantitation of growth, yeast cells were grown 
in tryptophan dropout minimal medium, harvested, and 
diluted to the optical density A 600 = 0.5. Serial 5-fold 
dilutions of the resulting cultures were prepared and 5 
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/ul of each dilution was spotted, onto selective medium 
plates lacking both tryptophan and histidine. As a 
control, the same amount of each dilution was spotted 
onto the minimal medium lacking only tryptophan. 

5 

EXAMPLE I 

Construction of vectors for the genetic assay of nuclear 
import 

These constructs are designed to express fusion 
10 proteins composed of three functional parts: a modified 
LexA protein, an activation domain of the GAL4 protein, 
and a protein to be tested for its nuclear import. These 
components were obtained and joined together as follows: 
(A) First, the Gal4 activation domain (AD) , derived 
15 from the pGAD424 plasmid (Fig. 1) , was PCR-amplif ied with 
and without the adjacent SV40 NLS (AD with NLS was used 
for positive control constructs, see below) . During 
amplification, EcoRI and BamHI restriction sites were 
introduced at the 5' and 3' ends of the amplified 
2 0 fragment, respectively. The PCR mixtures contained the 
following components: 

(a) PCR of GAL4 AD without NLS 
Primer GAD 5 (2 0 ,uM) 
Primer GAD3BdE (2 0 fiM) 

2 5 dNTPs (10 mM each for dATP, 

dTTP, dGTP, dCTP) 
Pfu reaction buffer (10X) 
Template DNA (pGAD424, 10 ng//xl) 
Pfu polymerase (0.5 p./ \xl) 

3 0 Double distilled water 



5 pi 

5 M l 

2 pi 

10 pi 

5 pi 

1 pi 

72 pi 

100 al 
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Primer GAD5 : SEQ ID NO : 8 : 

5 1 -GGGAA TTCAA TTTTA ATCAA AGTGG G-3 ' 
Primer GAD3BdE : SEQ ID NO : 9 : 

5'-GACGG ATCCC CGGGT ATTCG ATCTC TT-3' 
5 (b) PCR of GAL 4 AD with NLS 

Primer GAD 5 NLS (2 0 xiM) 5 /xl 

Primer GAD3BdE (2 0 /xM) 5 xxl 

dNTPs (10 mM each for dATP , 

dTTP , dGTP , dCTP) 2 /xl 

10 Pfu reaction buffer (10X) 10 /xl 

Template DNA (pGAD424, 10 ng/ttl) 5 /xl 

Pfu polymerase (0.5 tt/ til) 1 /xl 

Double distilled water 72 /xl 

TOTAL 100 /xl 

15 Primer GAD5NLS : SEQ ID NO: 10: 

5' -GGGAA TTCGA TAAAG CGGAA TTAAT TCCC-3' 
Primer GAD3BdE : SEQ ID NO: 11: 

S'-GACGG ATCCC CGGGT ATTCG ATCTC TT-3' 
PCR conditions for all reactions: 
20 94°C / 2 min. 1 cycle 

94°C / 45 sec: 45°C / 45 sec: 72°C / 2 min. 35 cycles 
72°C/10 min. 1 cycle 

(B) Then, wild-type LexA in the pBTM 116 vector 
(Fig. 2) was joined in-frame with Gal4 AD following 

25 restriction digestion of the corresponding purified PCR 
fragments with EcoRI and BamHI using standard molecular 
biology protocols. The resulting fusion constructs were 
designated pLexA : : GAL4AD (-)NLS and pLexA : : GAL4AD (+)NLS. 

(C) Next, two testing genes were introduced into 

3 0 pLexA : : GAL4AD ( - ) NLS and pLexA : : GAL4AD (+)NLS vectors for 
NLS-negative and NLS-positive controls. 

(a) NLS-negative protein, VirE2 of Agrobacterium 
tumefaciens . VirE2 is known to remain cytoplasmic when 
expressed in yeast and animal cells, making it a suitable 



negative control for the nuclear import assay. The BamHI 
fragment of the pEE2 plasmid (Fig. 3) containing the 
VirE2 ORF was subcloned in- frame into the BamHI site of 
the pLexA: : GAL 4 AD ( - ) NLS and pLexA : : GAL4AD (+)NLS 
5 vectors, placing it immediately downstream of GAL4 AD. 

(b) NLS-positive protein, VirD2 of Agrobacterium 
tumefaciens . VirD2 is known to accumulate in the cell 
nucleus when expressed in yeast and animal cells, making 
it a suitable positive control for the nuclear import 

10 assay. The BamHI fragment of the pED2 plasmid (Fig. 4) 
containing the VirD2 ORF was subcloned in- frame into the 
BamHI site of pLexA : : GAL4AD ( - ) NLS vector, placing it 
immediately downstream of Gal4 AD. 

(D) Finally, the LexA gene in the above described 

15 constructs was modified to remove its part that encodes a 
functional nuclear localization sequence (NLS) which was 
identified by amino acid sequence analysis of LexA. This 
was performed by site directed mutagenesis using a 
TRANSFORMER™ Site-Directed Mutagenesis Kit (Cat.# 

2 0 K1600-1) from CLONTECH Laboratories, Inc. according to 

the manufacturer's protocol. Specifically, two amino 
acids in the LexA protein were mutated to produce 
substitutions R157G and K159E by changing their codons 
CGC to GGC and AAA to GAA, respectively (resulting in the 
25 nucleotide sequence shown in SEQ ID NO : 2 , encoding the 

amino acid sequence shown in SEQ ID NO:l). The sequences 
for the mutagenesis primers were: 

Mutant primer [designated LexA ( -NLS ) ] : SEQ ID NO:12: 
5'-CCGTT AAGGG CCTGG AAAAA CAGGG-3 ' 

3 0 Selection primer (designated Seal -to-StuI ) : SEQ ID 

NO: 13 : 

5'-GTGAC TGGTG AGGCC TCAAC CAAGT C-3 ' 



This procedure produced a modified LexA which was 
designated mLexA. Collectively, the above described 
procedures yielded the following five constructs: 

1. pmLexA : : GAL4AD (-)NLS (Fig. 5) 

assay vector, the experimental construct in which 
the gene of interest should be subcloned in- frame 

2. pmLexA: : GAL 4 AD ( - ) NLS : : VirE2 (Fig. 6) 

negative import control for the assay 

3. pmLexA: : GAL4AD ( - ) NLS : : VirD2 (Fig. 7) 

positive import control for the assay 

4. pmLexA: : GAL4AD (+) NLS : : VirE2 (Fig. 8) 

positive control for the ability of the experimental 
construct to produce fusion protein capable of 
nuclear import, i.e. that the protein of interest 
does not non-specif ically alter the conformation of 
the fusion protein, preventing nuclear import even 
in the presence of an active NLS 

5. pmLexA: : GAL4AD ( - ) NLS : : 2DriV (Fig. 9) 

another negative control containing ant i sense 

orientation of VirD2 
All these plasmids are Amp r and TRP1, requiring growth on 
an ampic ill in- containing medium in E. coll and on a 
tryptophan drop-out medium in yeast cells. 



2 5 EXAMPLE II 

One-hybrid genetic assay for protein nuclear import 

The fusion protein derived from pLexA: :GAL4AD 
( - ) NLS : : VirE2 enters the cell nucleus and activates the 
reporter gene expression, indicating that LexA carries a 

3 0 cryptic NLS although it is a prokaryotic protein and is 

not expected to enter the nucleus. Thus it was necessary 
to identify and disable this signal, resulting in a 
modification of the LexA protein (see item D above) . 
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The fusion protein derived from pmLexA: : GAL4AD 
(-)NLS, which lacks a tested protein, enters the cell 
nucleus by diffusion due to its small size (approximately 
3 8 kDa) . This import, however, is less efficient than 
5 that of NLS-containing fusion products, resulting in a 
weaker expression of the (3-galactosidase reporter. 
Nevertheless, it is recommended to use pmLexA: : GAL4AD 
( - ) NLS : : VirE2 as a negative control for the assay. This 
106 kDa fusion protein does not enter the nucleus at all, 
10 producing zero expression of the reporter. 

The user has an option of constructing his/her own 
custom made negative control for this assay by subcloning 
the protein of interest in antisense orientation. 
Results indicate that VirD2, which targets the fusion 
15 product derived from pmLexA: : GAL4AD ( - ) NLS : : VirD2 to the 
nucleus, does not promote nuclear import when subcloned 
into the same vector in antisense orientation, i.e. 
pmLexA: : GAL4AD ( - ) NLS : : 2DriV . 

The current version of pmLexA: : GAL 4 AD (-)NLS 
2 0 includes only four unique cloning sites, Smal, BamHI , 
Sail and PstI, for insertion of the gene of interest. 
However, additional sites can be easily engineered, if 
required, using simple standard cloning techniques. 

Once the gene of interest is inserted in- frame into 
25 the pmLexA: : GAL4AD (-)NLS assay vector, it can be 

transformed into the L4 0 yeast strain (MAT a his3A2 0 0 
trpl-90 leu2-3,112 ade2 , lys2 : : LYS2 : : LexAHIS3 , 
ura3 : :URA3 : : LexA lacZ gal80) by any standard procedure 
using either Lithium acetate or electroporat ion (Ausubel 
30 et al . 1987) . For negative and positive controls, the 
appropriate constructs (described above) are separately 
introduced into L40 cells. 

The resulting yeast strains are grown on a selective 
medium and assayed for [3-galactosidase activity after one 



or two days of growth using standard procedures. A 
positive result, i.e. dark blue-stained yeast colonies, 
indicates active nuclear import of the fusion protein 
and, consequently, the presence of a functional NLS in 
5 the tested protein. 

EXAMPLE III 

Nuclear import assay. The basic strategy of these 
experiments is based on expression in yeast cells of a 

10 triple-fusion protein comprising bacterial LexA, yeast 
Gal4 activation domain (Gal4AD) , and the tested protein 
encoded by a cDNA subcloned in-frame downstream of Gal4AD 
(Fig. 16) . If the tested protein contains a functional 
NLS, the fusion product will enter the yeast cell 

15 nucleus. Following this nuclear import, the LexA domain 
will target the fusion protein to the LexA operator sites 
of the reporter lacZ gene contained in the L4 0 yeast 
strain. Gal4AD then activates the expression of lacZ, 
resulting in S-galactosidase activity. In the absence of 

2 0 a NLS, the fusion protein is unable to reach the cell 
nucleus and, thus, activate the reporter gene. 

In addition to induction of the S-galactosidase 
reporter, this one-hybrid system allows direct selection 
for the nuclear import of the tested protein in the same 

25 L40 yeast strain, which contains an integrated copy of 
the HIS3 gene with upstream LexA operators. Only cells 
expressing the NLS -containing fusion protein will grow on 
a histidine-def icient medium. 

Clearly, the success of this approach hinges on the 

30 inability of LexA-Gal4AD-tested protein fusions to enter 
the cell nucleus in the absence of an NLS contained 
within the tested protein. Thus, neither LexA nor Gal4AD 
should contain NLS sequences. While Gal4AD is known to 
lack NLS (Silver et al . 1988), LexA, a bacterial protein, 
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is generally thought not to have evolved such a signal. 
Surprisingly, however, the studies herein demonstrated 
that a LexA- Gal 4AD fusion containing VirE2, an 
Agrobacterium protein shown to lack an NLS functional in 
5 animal cells (Guralnick et al . 1996), induced the 

S-galactosidase reporter (Fig. 18, pLGE2 construct) and 
grew on a histidine dropout medium (Fig. 19, pLGE2 
construct) . 

VirE2 is a large protein (70 kDa) ; thus, the 
10 LexA-Gal4AD- VirE2 fusion is likely to be actively 
imported into the cell nucleus to allow this gene 
induction. Because the absence of NLS in LexA was 
implied from its bacterial origin rather than 
demonstrated directly, it is possible that LexA carries a 
15 previously unidentified NLS signal. Inspection of the 

amino acid sequence of LexA identified a short stretch of 
basic residues (Fig. 17) which may function as an NLS. 
Two amino acid substitutions, R157G and K159E, were made 
in this motif (Fig. 17) , resulting in a modified LexA 
20 (mLexA) within the triple- fusion expression vector, 

designated pNIA (Nuclear Import Assay) . Figs. 18 and 19 
show that mLexA expressed in fusion with Gal4AD and VirE2 
from the pNIAE2 construction no longer activated the 
reporter genes la.cZ (Fig. 18) and HI S3 (Fig. 19) , 

2 5 indicating the lack of nuclear import of the fusion 

protein . 

To exclude a possibility that LexA mutagenesis 
non- specif ically inactivated this protein, a short amino 
acid sequence corresponding to the SV40 large T-antigen 

3 0 NLS was introduced between the mLexA and Gal4AD domains 

of pNIAE2 , producing pNIA(+)E2. The fusion protein 
produced from this construct localized to the cell 
nucleus, resulting in la.cZ induction (Fig. 18) and cell 
growth in the absence of histidine (Fig. 19) and 
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indicating mLexA functionality in activation of gene 
expression. Note that the presence of the tested protein 
within the pNIA construction is essential for detection 
of bona fide nuclear import; in the absence of the tested 
5 protein, the mLexA-Gal4AD fusion produced from pNIA alone 
may simply diffuse into the nucleus due to its small size 
(data not shown) . 

Next, pNIA was used to test its ability to detect a 
functional NLS within a known nuclear protein. To this 

10 end, an NLS -containing protein, VirD2 of Agrobacterium 

(Citovsky et al . 1994; Howard et al . 1992), was subcloned 
into pNIA (pNIAD2 construction) . The resulting 
mLexA-Gal4AD-VirD2 fusion protein was imported into the 
cell nucleus as illustrated by activation of the reporter 

15 genes lacZ (Fig. 18) and HIS3 (Fig. 19) . When, as a 

negative control, a VirD2 cDNA sequence was inserted into 
pNIA in the antisense orientation, the fusion product did 
not activate the reporter genes (data not shown) . These 
results demonstrate the pNIA allows detection of and 

2 0 selection for proteins containing a functional NLS 
sequence . 

More particularly, Fig. 16 shows the plasmid 
compositions. pNIA expresses a fusion protein consisting 
of mLexA, Gal 4 AD, and protein to be tested; pNEA produces 

2 5 a fusion between mLexA, SV4 0 NLS, Gal4AD, and a tested 

protein. Asterisk indicates the position of the LexA NLS. 
MCS indicates the multiple cloning sites that include the 
sites for Smal , BamHI , Sail, and PstI restriction 
endonucleases . The plasmid backbone is derived from 

30 pBTM116 (Hollenberg et al . 1995). Fig. 17 shows the LexA 
NLS and amino acids substitutions (asterisks) which 
inactivate this signal, producing modified LexA (mLexA) . 
Numbers indicate the position of the nucleotides (top) 
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and amino acid residues (bottom) within LexA gene and 
protein sequences, respectively. 

Figs. 18 and 19 show the results of the nuclear 
import assay. Fig. 18 shows the S-galactosidase assay 
5 following cell growth on minimal medium without 

tryptophan. Fig. 19 shows the selection assay by cell 
growth on minimal medium deficient for both tryptophan 
and histidine and supplemented with 5 mM of 3AT . pLGE 
expresses VirE2 fused to wild- type LexA and Gal4AD, 
10 pNIAE2 expresses VirE2 fused to modified LexA (mLexA) and 
Gal 4 AD, pNIAE2 expresses VirE2 fused to mLexA, SV4 0 NLS , 
and Gal 4 AD, and pNIAD2 expresses VirD2 fused to mLexA and 
Gal4AD. 



15 EXAMPLE IV 

Construction of vectors for the genetic assay of nuclear 
export 

These constructs are designed to express fusion 
proteins composed of three functional parts: a modified 

2 0 LexA protein, activation domain of the GAL 4 protein, and 

a protein to be tested for its nuclear export. These 
components were obtained and joined together as follows: 
(A) First, the Gal4 activation domain (AD) with the 
adjacent SV4 0 NLS, derived from the pGAD424 plasmid (Fig. 
25 1) , was PCR-amplif ied. During amplification, EcoRI and 
BamHI restriction sites were introduced at the 5' and 3' 
ends of the amplified fragment, respectively. The PCR 
mixtures contained the following components: 

Primer GAD5NLS (2 0 pM) 5 pi 

3 0 Primer GAD3BdE (2 0 pM) 5 pi 

dNTPs (10 mM each for dATP, 

dTTP, dGTP, dCTP) 2 pi 

Pfu reaction buffer (10X) 10 pi 

Template DNA (pGAD424, 10 ng/pl) 5 pi 
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Pfu polymerase (0.5 fi/ fil) 



1 pel 
72 fil 
100 fil 



Double distilled water 



TOTAL 



Primer GAD5NLS : SEQ ID NO: 10: 



5 



5 ' -GGGAA TTCGA TAAAG CGGAA TTAAT 



TCCC-3 ' 



Primer GAD3BdE : SEQ ID NO: 11: 



5 ' -GACGG ATCCC CGGGT ATTCG ATCTC 



TT-3 ' 



PCR conditions for all reactions: 



94°C / 2 min. 



1 cycle 



10 94°C / 45 sec: 45°C / 45 sec: 72°C / 2 min. 35 cycles 



(B) Then, wild-type LexA in the pBTM116 vector (Fig. 
2) was joined in-frame with Gal4 AD following restriction 
digestion of the corresponding purified PCR fragments 

15 with EcoRI and BamHI using standard molecular biology 

protocols. The resulting fusion construct was designated 
pLexA : : GAL4AD ( + ) NLS . 

(C) Next, the LexA gene in pLexA: : GAL4AD (+)NLS was 
modified to remove its part that encodes a functional 

20 nuclear localization sequence (NLS) which had been 

identified by amino acid sequence analysis of LexA. This 
was performed by site directed mutagenesis using 
Transformer™ Site-Directed Mutagenesis Kit (Cat.# 
K1600-1) from CLONTECH Laboratories, Inc. according to 

25 the manufacturer's protocol. Specifically, two amino 
acids in the LexA protein were mutated to produce 
substitutions R157G and K159E by changing their codons 
CGC to GGC and AAA to GAA, respectively. The sequences 
for the mutagenesis primers were: 

3 0 Mutant primer [designated LexA ( -NLS)] : SEQ ID NO: 12: 

5 ' -CCGTT AAGGG CCTGG AAAAA CAGGG-3 ' 

Selection primer (designated Seal - to- StuI ) : SEQ ID 
NO: 13 : 

5'-GTGAC TGGTG AGGCC TCAAC CAAGT C-3 ' 



72°C/10 min. 



1 cycle 



This procedure produced a modified LexA which was 
designated mLexA, resulting in the pmLexA: : GAL4AD (+)NLS 
construct, also designated pNEA (Fig. 13) (nuclear export 
assay) . 

5 (D) Finally, two testing genes were introduced into 

pNEA for NES-negative and NES-positive controls. 

(a) NES-negative protein, VirE2 of Agrobacterium 
tumefaciens . VirE2 , known to lack NES, was used as 
negative control for the nuclear export assay. BamHI 

10 fragment of pEE2 plasmid (Fig. 3) containing the VirE2 
ORF was subcloned in- frame into the BamHI site of pNEA, 
placing it immediately downstream of GAL4 AD. 

(b) NES-positive protein, Rev of HIV type-1 virus. 
Rev is a known nuclear shuttle protein which contains a 

15 leucine-rich NES, making it a suitable positive control 
for the nuclear export assay. Rev cDNA was PCR-amplif ied 
as a Bglll fragment from pDM121 (McDonald et al . (1998)) 
and subcloned in-frame into the BamHI site of pNEA, 
placing it immediately downstream of Gal4 AD. 
20 Collectively, the described above procedures yielded, 

the following three constructs: 
1. pNEA (Fig. 13) 

assay vector, the experimental construct in which 
the gene of interest should be subcloned in- frame 
25 2. pNEA: :VirE2 (Fig. 15) 

negative import control for the assay 
3. pNEA : : Rev (Fig. 14) 

positive import control for the assay 
All these plasmids are Amp r and TRP1, requiring growth on 
3 0 an ampicillin-containing medium in E . coli and on a 
tryptophan drop-out medium in yeast cells. 
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EXAMPLE V 

One -hybrid genetic assay for protein nuclear export 

Once the gene of interest is inserted in- frame into 
the pNEA assay vector, it can be transformed into the L40 
5 yeast strain (MATa his3A200 trpl-90 leu2-3,112 ade2 , 
lys2 : :LYS2 : : lexAHIS3 , ura3 : : URA3 : : lexA lacZ gal8 0) by 
any standard procedure using either lithium acetate or 
electroporation (Ausubel et al . 1987) . For negative and 
positive controls, the appropriate constructs (described 

10 above) are separately introduced into L40 cells. 

The resulting yeast strains are grown on a selective 
medium and assayed for 6-galactosidase activity after one 
or two days of growth using standard procedures . The 
appearance of white yeast colonies indicates active 

15 nuclear export of the fusion protein and, consequently, 
the presence of a functional NES in the tested protein. 

In addition, nuclear import of the fusion product in 
L40 induces an auxotrophic marker HIS3, resulting in the 
ability of the yeast cells to grow on a 

20 histidine-def icient medium. Thus, if cells transformed 
with pNEA carrying the gene of interest are plated first 
on tryptophan- deficient medium to select for the pNEA 
construct and then replica-plated on a 
tryptophan-histidine double dropout medium, nuclear 

2 5 export will be indicated by the appearance of yeast 

colonies that grow in the absence of tryptophan but do 
not grow in the absence of histidine. 

EXAMPLE VI 

3 0 Nuclear export assay. The ability of pNIA to detect 

protein transport into the nucleus can also be utilized 
to assay for a reverse protein traffic, i.e., nuclear 
export. To this end, the SV40 large T-antigen NLS was 
introduced between mLexA and Gal 4 AD of pNIA, resulting in 
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a Nuclear Export Assay plasmid pNEA (Fig. 16) . The 
separate SV4 0 NLS rather than the internal NLS of 
wild-type LexA was chosen to retain the modular 
composition of the vector and utilize the same mLexA 
5 component as in the nuclear import assay, facilitating 
direct comparison of results obtained with the pNIA and 
pNEA constructs. In pNEA, fusion to a protein without an 
NES will result in nuclear import due to the presence of 
the SV40 NLS. Yeast cells harboring this construction 

10 will express S-galactosidase and grow in the absence of 
histidine. Indeed, as mentioned above, subcloning of 
VirE2 into pNEA (same as pNIA(+)E2 construction) resulted 
in a strong g-galactosidase staining (Fig. 18) and 
histidine prototrophy (Fig. 19) . Note that VirE2 in pNIA 

15 did not induce these effects (Figs. 18 and 19). 

Fusion to an NES -containing protein, on the other 
hand, is expected to redirect the protein product into 
the cell cytoplasm, at least partly abolishing the 
S-galactosidase activity and impeding growth without 

20 histidine. This idea was tested using the Rev protein of 
HIV-1 known to carry a functional NES (Ullman et al . 
1997) . Expression of Rev from the pNEA vector 
dramatically decreased S-galactosidase activity to about 
12% of that observed with pNEA alone (Fig. 20) , 

25 suggesting the predominantly cytoplasmic localization of 
the fusion product. Residual levels of la.cZ activity are 
probably due to a small steady- state pool of Rev protein 
within the cell nucleus due to its nuclear shuttling 
activity (Pollard et al . 1998). 

30 That the decrease in la.cZ induction specifically 

depends on the Rev NES was demonstrated by mutating or 
deleting this signal. First, the M10 mutant of Rev 
(Malim et al . 1989) was introduced into pNEA. Fig. 20 
shows that the M10 NES mutation, which substitutes only 



two amino acid residues within NES (Malim et al . 1989), 
restored the S-galactosidase activity to 30% that of the 
maximum, indicating diminished nuclear export of the 
mutant fusion protein as compared to the wild-type Rev. 
5 A deletion mutation of the Rev NES, RevA3 , which removes 
most of the signal sequence (Taagepera et al . 1998), 
increased la.cZ reporter gene induction to 70% to 90% of 
the maximal level (Fig. 20) . 

Changes in the degree of lacZ gene expression caused 

10 by the Rev NES closely paralleled HIS3 expression. 

Serial dilutions of yeast cell cultures plated on the 
histidine dropout selective medium clearly demonstrated a 
dramatic reduction in histidine prototrophy supported by 
the Rev fusion product. This effect was NES-dependent 

15 because both the M10 and RevA3 mutations gradually 

restored growth on the selective medium. (Fig. 21) . In 
the absence of selection, all strains exhibited equal 
growth (Fig. 22) . These results indicate that the degree 
of repression of the lacZ and HIS3 reporter genes and, by 

2 0 implication nuclear export, directly reflects the 

strength of the NES signal, allowing the use of this 
nuclear export assay to give a quantitative indication of 
and select for the activity of NES signals in proteins of 
interest . 

25 More particularly, Figs. 20-22 shows the results of 

the nuclear export assay. Fig. 20 shows the quantitative 
S-galactosidase assay in liquid following cell growth in 
minimal medium without tryptophan. Standard errors are 
shown based on five independent experiments. (3- 

3 0 galactosidase activity is expressed as percent of maximal 

enzymatic activity (usually 100-200 units) obtained with 
pNEA alone. Fig. 21 shows the selection assay by cell 
growth on minimal medium deficient for both tryptophan 
and histidine and supplemented with 100 mM 3AT . This 3 AT 
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concentration was optimal for detecting differences in 
cell growth between various Rev derivatives. Fig. 22 
shows cell growth on minimal medium deficient for only 
tryptophan. (1), pNEA alone; (2), pNEARev <NES : SEQ ID 
5 N0:5: LPPLERLTL) ; (3), pNEAMlO (mutated NES : SEQ ID NO : 6 : 
LPPDLRLTL) ; (4), pNEARevA3 (residual NES: SEQ ID NO : 7 : 
LPPL) . 

EXAMPLE VII 

10 Identification of a functional NES in the capsid 

protein of a geminivirus . Tomato yellow leaf curl virus 
(TYLCV) is a constant threat to tomato growers around the 
world (Cohen et al . 1964) . TYLCV is a monopartite 
geminivirus containing only one genomic circular ssDNA 

15 encapsulated by the viral capsid protein (CP) (Davies et 
al . 1989). Upon infection, TYLCV is imported into the 
host plant cell nucleus where DNA replication, 
transcription, and virus assembly presumably take place 
(Navot et al . 1991). Whereas nuclear import of TYLCV is 

20 likely mediated by its NLS-bearing CP (Kunik et al . 

1998) , the mechanism by which this virus is exported from 
the nucleus for cell-to-cell movement and spread of 
infection remains unknown. Here, the pNEA-based nuclear 
export assay and histidine selection were used to 

2 5 demonstrate that, in addition to its NLS, TYLCV CP 

contains a NES functional in yeast. 

Fig. 23 shows that, similarly to Rev, a CP fusion 
substantially decreased histidine prototrophy, indicating 
reduction in HIS3 gene expression and, by implication, 

3 0 the presence of an active NES within CP. Next, the CP 

NES was mapped relative to its known NLS sequences which 
reside at the amino terminus (major NLS) and in the 
middle part of the protein (augmenting NLS) (Kunik et al . 
1998) . The CP amino terminus promoted efficient 
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expression of the reporter genes (data not shown) , 
suggesting that the CP NES is located within the deleted 
part of the protein, i.e. residues 38 to 260. Deletion 
of amino acid residues from 38 to 113 (CPAM mutant) , on 
5 the other hand, did not enhance HI S3 gene expression 

(Fig. 23) . This result indicates that the CP NES is not 
present in the middle portion of CP; in fact, deletion of 
the augmenting middle NLS apparently enhanced nuclear 
export of the mutant protein as compared to the full 

10 length CP (Fig. 23) . In contrast, removal of the CP 
carboxy terminus (residues 114 to 260, CPAC mutant) 
restored HIS3 expression (Fig. 23), suggesting that the 
deleted carboxy terminal region contains a functional NES 
signal. The differences in colony formation on the 

15 selective medium reflected changes in the expression of 
the HIS3 reporter because CP and all its mutants 
exhibited equal growth in the absence of histidine 
selection (Fig. 24) . Thus, CP likely contains two types 
of spatially distant targeting signals, amino terminal 

2 0 and middle NLSs and a carboxy terminal NES. 

More particularly, Figs. 23 and 24 show the 
detection of NES within TYLCV CP. Fig. 23 shows the 
selection assay by cell growth on minimal medium 
deficient for both tryptophan and histidine. Fig. 24 

25 shows cell growth on minimal medium deficient for only 
tryptophan. (1), pNEA alone; (2) pNEACP; (3) pNEACPAM; 
(4) pNEACPAC. 

Although preferred embodiments have been depicted 
30 and described in detail herein, it will be apparent to 
those skilled in the relevant art that various 
modifications, additions, substitutions and the like can 
be made without departing from the spirit of the 
invention and these are therefore considered to be within 
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the scope of the invention as defined in the claims which 
follow . 
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What Is Claimed Is: 

1 1 . A method of determining the presence of a 

2 nuclear localization signal in a protein of interest, the 

3 method comprising: 

4 selecting a host cell for use in the method, wherein 

5 the host cell contains a nucleus having nucleic acid 

6 encoding a reporter gene therein and wherein the host 

7 cell has a first level of expression of the reporter 

8 gene ,- 

9 identifying a DNA binding domain and an activation 

10 domain for the reporter gene; 

11 constructing a chimeric nucleic acid encoding a 

12 fusion protein comprising the DNA binding domain, the 

13 activation domain, and a protein of interest, wherein 

14 elements of the fusion protein other than the protein of 

15 interest have no nuclear localization signals, - 

16 introducing the chimeric nucleic acid into the host 

17 cell; and 

18 determining a second level of expression of the 

19 reporter gene to determine the presence of a nuclear 

20 localization signal in the protein of interest. 

1 2. The method of claim 1 wherein the host cell is 

2 a eukaryotic cell. 

1 3. The method of claim 1 wherein the host cell is 

2 a yeast cell. 

1 4 . The method of claim 1 wherein the reporter gene 

2 is a lacZ gene. 

1 5 . The method of claim 1 wherein the reporter gene 

2 is a selection marker gene. 



1 6. The method of claim 5 wherein the selection 

2 marker gene is a HIS3 gene. 



1 7. The method of claim 4 or 6 wherein the DNA 

2 binding domain is a LexA protein. 

1 8 . The method of claim 4 or 6 wherein the 

2 activation domain is a GAL4 activation domain. 

1 9. The method of claim 1 wherein the chimeric 

2 nucleic acid further comprises nucleic acid encoding a 

3 promoter to control expression of the fusion protein. 

1 10. The method of claim 9 wherein the promoter is 

2 an ADH1 promoter . 

1 11. A recombinant host cell comprising: 

2 a nucleus having nucleic acid encoding a reporter 

3 gene therein; and 

4 a chimeric nucleic acid encoding a fusion protein, 

5 the fusion protein comprising a DNA binding domain for 

6 the reporter gene, an activation domain for the reporter 

7 gene, and a protein of interest, wherein elements of the 

8 fusion protein other than the protein of interest have no 

9 nuclear localization signals. 

1 12 . The recombinant host cell of claim 11 wherein 

2 the host cell is a eukaryotic cell. 

1 13. The recombinant host cell of claim 11 wherein 

2 the host cell is a yeast cell. 



1 14. The recombinant host cell of claim 11 wherein 

2 the reporter gene is a lacZ gene. 



1 15. The recombinant host cell of claim 11 wherein 

2 the reporter gene is a selection marker gene. 

1 16. The recombinant host cell of claim 15 wherein 

2 the selection marker gene is a HIS3 gene. 

1 17. The recombinant host cell of claim 14 or 16 

2 wherein the DNA binding domain is a LexA protein. 

1 18. The recombinant host cell of claim 14 or 16 

2 wherein the activation domain is a GAL4 activation 

3 domain. 

1 19 . The recombinant host cell of claim 11 wherein 

2 the chimeric nucleic acid further comprises nucleic acid 

3 encoding a promoter to control expression of the fusion 

4 protein. 

1 20. The recombinant host cell of claim 19 wherein 

2 the promoter is an ADH1 promoter. 

1 21. A chimeric nucleic acid encoding a fusion 

2 protein, the fusion protein comprising a DNA binding 

3 domain for a reporter gene, an activation domain for the 

4 reporter gene, and a protein of interest, wherein 

5 elements of the fusion protein other than the protein of 

6 interest have no nuclear localization signals. 

1 22. The chimeric nucleic acid of claim 21 wherein 

2 the reporter gene is a lacZ gene. 



- 49 - 



1 23. The chimeric nucleic acid of claim 21 wherein 

2 the reporter gene is a selection marker gene. 

1 24. The chimeric nucleic acid of claim 23 wherein 

2 the selection marker gene is a HIS3 gene. 

1 25. The chimeric nucleic acid of claim 22 or 24 

2 wherein the DNA binding domain is a LexA protein. 

1 26. The chimeric nucleic acid of claim 22 or 24 

2 wherein the activation domain is a GAL4 activation 

3 domain. 

1 27. The chimeric nucleic acid of claim 21 further 

2 comprising nucleic acid encoding a promoter to control 

3 expression of the fusion protein. 

1 28. The chimeric nucleic acid of claim 27 wherein 

2 the promoter is an ADH1 promoter. 

1 29. A vector comprising the chimeric nucleic acid 

2 of claim 21. 

1 30. A kit comprising the vector of claim 29. 

1 31. The kit of claim 30 further comprising host 

2 cells which contain a nucleus having nucleic acid 

3 encoding the reporter gene therein. 



1 32. The kit of claim 31 further comprising a 

2 control vector. 
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1 33. A nucleic acid molecule encoding a modified 

2 LexA protein, wherein the modified LexA protein has no 

3 nuclear localization signal. 

1 34. The nucleic acid molecule of claim 33 wherein 

2 the nucleic acid molecule has a nucleotide sequence as 

3 shown in SEQ ID NO : 1 . 

1 35. The nucleic acid molecule of claim 33 wherein 

2 the nucleic acid molecule encodes an amino acid sequence 

3 as shown in SEQ ID NO : 2 . 

1 36. A modified LexA protein, wherein the modified 

2 LexA protein has no nuclear localization signal . 

1 37. The modified LexA protein of claim 36 wherein 

2 the protein has an amino acid sequence as shown in SEQ ID 

3 NO : 2 . 

1 3 8. A method of determining the presence of a 

2 nuclear export signal in a protein of interest, the 

3 method comprising: 

4 selecting host cells for use in the method, wherein 

5 each of the host cells contain a nucleus having nucleic 

6 acid encoding a reporter gene therein; 

7 identifying a DNA binding domain and an activation 

8 domain for the reporter gene; 

9 constructing a chimeric nucleic acid encoding a 

10 fusion protein comprising the DNA binding domain, the 

11 activation domain, and a nuclear localization signal, 

12 wherein elements of the fusion protein have no nuclear 

13 export signals; 

14 introducing the chimeric nucleic acid into one of 

15 the host cells; 
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16 determining a first level of expression of the 

17 reporter gene; 

18 constructing a second chimeric nucleic acid encoding 

19 a second fusion protein comprising the DNA binding 

20 domain, the activation domain, the nuclear localization 

21 signal, and a protein of interest ; 

22 introducing the second chimeric nucleic acid into 

23 another one of the host cells; and 

24 determining a second level of expression of the 

25 reporter gene to determine the presence of a nuclear 

26 export signal in the protein of interest. 

1 39. The method of claim 38 wherein the host cells 

2 are eukaryotic cells. 

1 40. The method of claim 38 wherein the host cells 

2 are yeast cells. 

1 41. The method of claim 3 8 wherein the reporter 

2 gene is a lacZ gene. 

1 42. The method of claim 38 wherein the reporter 

2 gene is a selection marker gene. 

1 43. The method of claim 42 wherein the selection 

2 marker gene is a HIS3 gene. 

1 44. The method of claim 38 wherein the nuclear 

2 localization signal is an SV40 nuclear localization 

3 signal . 



1 45. The method of claim 41 or 43 wherein the DNA 

2 binding domain is a LexA protein. 
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1 46. The method of claim 41 or 43 wherein the DNA 

2 binding domain and the nuclear localization signal are a 

3 LexA protein. 

1 47. The method of claim 41 or 43 wherein the 

2 activation domain is a GAL 4 activation domain. 

1 48. The method of claim 38 wherein the chimeric 

2 nucleic acid further comprises nucleic acid encoding a 

3 promoter to control expression of the fusion protein. 

1 49. The method of claim 3 8 wherein the second 

2 chimeric nucleic acid further comprises nucleic acid 

3 encoding a promoter to control expression of the second 

4 fusion protein. 

1 50. The method of claim 48 or 49 wherein the 

2 promoter is an ADH1 promoter. 

1 51. A recombinant host cell comprising: 

2 a nucleus having nucleic acid encoding a reporter 

3 gene therein; and 

4 a chimeric nucleic acid encoding a fusion protein, 

5 the fusion protein comprising a DNA binding domain for 

6 the reporter gene, an activation domain for the reporter 

7 gene, and a nuclear localization signal, wherein elements 

8 of the fusion protein have no nuclear export signals. 

1 52. The recombinant host cell of claim 51 wherein 

2 the fusion protein further comprises a protein of 

3 interest. 

1 53. The recombinant host cell of claim 51 wherein 

2 the host cell is a eukaryotic cell . 
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1 54. The recombinant host cell of claim 51 wherein 

2 the host cell is a yeast cell. 

1 55. The recombinant host cell of claim 51 wherein 

2 the reporter gene is a lacZ gene. 

1 56. The recombinant host cell of claim 51 wherein 

2 the reporter gene is a selection marker gene. 

1 57. The recombinant host cell of claim 56 wherein 

2 the selection marker gene is a HIS3 gene. 

1 58. The recombinant host cell of claim 51 wherein 

2 the nuclear localization signal is an SV4 0 nuclear 

3 localization signal. 

1 59. The recombinant host cell of claim 55 or 57 

2 wherein the DNA binding domain is a LexA protein. 

1 60. The recombinant host cell of claim 55 or 57 

2 wherein the DNA binding domain and the nuclear 

3 localization signal are a LexA protein. 

1 61. The recombinant host cell of claim 55 or 57 

2 wherein the activation domain is a GAL4 activation 

3 domain. 

1 62. The recombinant host cell of claim 51 wherein 

2 the chimeric nucleic acid further comprises nucleic acid 

3 encoding a promoter to control expression of the fusion 

4 protein. 
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1 63 . The recombinant host cell of claim 62 wherein 

2 the promoter is an ADH1 promoter. 

1 64 . A chimeric nucleic acid encoding a fusion 

2 protein, the fusion protein comprising a DNA binding 

3 domain for a reporter gene, an activation domain for the 

4 reporter gene, and a nuclear localization signal, wherein 

5 elements of the fusion protein have no nuclear export 

6 signals. 

1 65. The chimeric nucleic acid of claim 64 wherein 

2 the fusion protein further comprises a protein of 

3 interest . 

1 66. The chimeric nucleic acid of claim 64 wherein 

2 the nuclear localization signal is an SV40 nuclear 

3 localization signal. 

1 67. The chimeric nucleic acid of claim 64 wherein 

2 the DNA binding domain is a LexA protein. 

1 68. The chimeric nucleic acid of claim 64 wherein 

2 the DNA binding domain and the nuclear localization 

3 signal are a LexA protein. 

1 69. The chimeric nucleic acid of claim 64 wherein 

2 the activation domain is a GAL4 activation domain. 

1 70. The chimeric nucleic acid of claim 64 wherein 

2 the chimeric nucleic acid further comprises nucleic acid 

3 encoding a promoter to control expression of the fusion 

4 protein. 
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1 71. The chimeric nucleic acid of claim 70 wherein 

2 the promoter is an ADH1 promoter. 

1 72 . A vector comprising the chimeric nucleic acid 

2 of claim 64. 

1 73 . A kit comprising the vector of claim 72 . 

1 74 . The kit of claim 73 further comprising host 

2 cells which contain a nucleus having nucleic acid 

3 encoding the reporter gene therein. 

1 75. The kit of claim 74 wherein the reporter gene 

2 is a lacZ gene. 

1 76. The kit of claim 74 wherein the reporter gene 

2 is a selection marker gene. 

1 77. The kit of claim 76 wherein the selection 

2 marker gene is a HIS3 gene. 

1 78. The kit of claim 73 further comprising a 

2 control vector. 



GENETIC ASSAY FOR PROTEIN NUCLEAR TRANSPORT 



ABSTRACT OF THE DISCLOSURE 

The invention provides methods of determining the 
5 presence of a nuclear localization signal and/or the 
presence of a nuclear export signal in a protein of 
interest . The invention further provides chimeric 
nucleic acids and recombinant host cells for use in such 
methods. Additionally provided is a nucleic acid 

10 molecule encoding a modified LexA protein, wherein the 

modified LexA protein has no nuclear localization signal, 
as well as the modified LexA protein itself. In the 
nuclear import assay, if a protein of interest fused to a 
mLexA-Gal4AD hybrid contains a functional NLS, the fusion 

15 product will enter the yeast cell nucleus and activate 

the expression of reporter genes. In the nuclear export 
assay, if a protein of interest fused to a mLexA-SV40 
NLS-Gal4AD hybrid contains a functional NES, the fusion 
product localized to the cell nucleus will exit into the 

2 0 cytoplasm, decreasing the reporter gene expression 
levels . 
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Sequences Listing 
SEQ ID N0:1 

ATGAAAGCGTTAACGGCCAGGCAACAAGAGGTGTTTGATCTCATCCGTGATCACAT 
CAGCCAGACAGGTATGCCGCCGACGCGTGCGGAAATCGCGCAGCGTTTGGGGTTCG 
TTCCCCAAACGCGGCTGAAGAACATCTGAAGGCGCTGGCACGCAAAGGCGTTATTG 
AAATTGTTTCCGGCGCATCACGCGGGATTCGTCTGTTGCAGGAAGAGGAAGAAGGG 
TTGCCGCTGGTAGGTCGTGTGGCTGCCGGTGAACCACTTCTGGCGCAACAGCATAT 
TGAAGGTCATTATCAGGTCGATCCTTCCTTATTCAAGCCGAATGCTGATTTCCTGC 
TGCGCGTCAGCGGGATGTCGATGAAAGATATCGGCATTATGGATGGTGACTTGCTG 
GCAGTGCATAAAACTCAGGATGTACGTAACGGTCAGGTCGTTGTCGCACGTATTGA 
TGACGAAGTTACCGTTAAGgGCCTGgAAAAACAGGGCAATAAAGTCGAACTGTTGC 
CAGAAAATAGCGAGTTTAAACCAATTGTCGTTGACCTTCGTCAGCAGAGCTTCACC 
ATTGAAGGGCTGGCGGTTGGGGTTATTCGCAACGGCGACTGGCTGgaattc 



SEQ ID NO: 2 

MKALTARQQE VFDLIRDHIS QTGMPPTRAE IAQRLGFRSP NAAEEHLKAL 

ARKGVIEIVS GASRGIRLLQ EEEEGLPLVG RVAAGEPLLA QQHIEGHYQV 

DPSLFKPNAD FLLRVSGMSM KDIGIMDGDL LAVHKTQDVR NGQVWARID 

DEVTVKGLEK QGNKVELLPE NSEFKPIWD LRQQSFTIEG LAVGVIRNGD 
WLEF 



SEQ ID NO : 3 : PKKKRKV 

SEQ ID NO:4: KRXXXXXXXXXXKKKL 

SEQ ID NO : 5 : LPPLERLTL 

SEQ ID NO : 6 : LPPDLRLTL 

SEQ ID NO : 7 : LPPL 

SEQ ID NO : 8 : 

5 ' -GGGAA TTCAA TTTTA ATCAA AGTGG G-3 ' 

SEQ ID NO: 9: 

5'-GACGG ATCCC CGGGT ATTCG ATCTC TT-3 1 

SEQ ID NO: 10: 

5' -GGGAA TTCGA TAAAG CGGAA TTAAT TCCC-3' 

SEQ ID NO: 11: 

5'-GACGG ATCCC CGGGT ATTCG ATCTC TT-3' 

SEQ ID NO: 12 : 

5'-CCGTT AAGGG CCTGG AAAAA CAGGG-3 ' 

SEQ ID NO: 13 : 

5 ' -GTGAC TGGTG AGGCC TCAAC CAAGT C-3' 
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